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1 MVAAAAATEA RLRRRTAATA ALAGRSGGPH CVNGGRCNPG TGQCVCPAGW 
51 VGEQCQHCGG RFR LTGSSGF VTDGPGNYK Y KTKCTWLIEG QPNRIMRLRF 
10i NHFATECSWD HLYVYDGDSI YAPLVAAFSG LIVPERDGNE TVPEVVATSG 
151 YALLHFFSDA AYNLTGFNIT YSFDMCPNNC SGRGECKISN SSETVECECS 
201 ENWK GEACDI P HCTDNCGFP HRGICNSSDV- RGCSCFSDWQ GPGCSVPVPA 
251 NQSFWTREEY SNLKLPRASH KAVVNGNIMW WGGYMFNHS DYNMVLAYDL 
301 AS REWLPLNR SVNNVWRYG HSLALYK DKI YMYGGKIDPT GNVTNELRVF 
351 HIHNESWVLL TPKAKEQYAV VGHSAHIVTL KNGRWMLVI FGHCPLYGYI 
401 SNVQEYDLDK NTWSILHTQG ALVQGGYGHS SVYDHRT RAL YVHGGYK AFS 
451 ANKYRLADDL Y RYDVDTQMW TILK DSRFFR YLHTAVIVSG TMLVFGGNTH 
' 501 NDTSMSHGAK CFSSDFMAYD IACDRWSVLP RPDLHHDVNR FGHSAVLHNS 
551 TMYVFGGFNS LLLSDILVFT SEQCDAHRSE AACLAAGPGI RCVWNTGSSQ 
601 CISWALATDE QEEKLKSECF c.orT.nHnRr DOHTDCYSCT ANTNDCHWCN 
651 DHCVPRNHSC SEGQISIF° v ^rPKDNPMY YCNKKTSCRS CALDQNCQWE 
701 PRNQECIALP ENICGIGWHL VGNSCLKITT AKENYDNAKL FCRNHNALLA 
751 SLTTQKKVEF VLKQLRIMQS SOSMS KLTLT PWVGLR KINV SYWCWEDMSP 
801 FTNSLLQWMP SEPSDAGFCG ILSEPSTRGL KAATCINPLN GSVCERPANH 
851 SAKQCRTPCA LRTACGDCTS. GSSECMWCSN MKQCVDSNAY VASFPFGQCM 
901 EWYTMSTCPP ENCSGYCTCS HCLEQPGCGW CTDPSNTGKG KCIEGSYKGP 
951 VKMPSQAPTG NFYPQPLLNS SMCLEDSRYN WSFIHCPACQ CNGHSKCINQ 
1001 SICEKCENLT TGKHCETCIS GFYGDPTNGG KCQPCKCNGH ASLCNTNTGK 
1051 CFCTTKGVKG DECQLCEVEN RYQGNPLRG T CYYTLLIDYQ FTFSLSQEDD 
1101 RYYTAINFVA TPDEQNRDLD mptm.^KFM LNITWAASFS AGTQAGEEMP 
U51 w<ttTMTKEY KDSFSNEKFD FRNHPNITFF wvsWFTWPI KIQVQTEQ 



WO 00/156: 



4 / 20 



PCT/US99/20948 




< 



WO 00/1565 



5 / 20 



09/7870 

PCT/US99/20948 



A 



Kozak sequence v x \ 

GggAjigATGG (££CQ * £ MP; 2b; 



Open Reading Frame 



l'NE.1 pot^A) 



3504 3904 



R84298 EST 



Mlel 



Northern probe 



EooNt 



(Hindiii) Library probe 



pks^3-1 



Clal 



pks-43 



fU 



6 



boar*, 
bindng 



Att/actjn protein domains 



1199 amino acids 



fas? 
: r 



Conserved cysteines 



EOF 



F33C8.1 protein domains 



1291 amino acids 



SSCL \D 



Attractin 



Minimum serine protease 

- # -x-x-x- a-x-x-x- # -x ( 1 o ) 



Prolyl oligo 
peptidase 



Trypsin 




WO 00/156: 



6 



/ 20 



09/787097 

PCT/US99/20948 




09/787097 

. WQ 00/1565^^. PCTAJS99/20948 



8 / 20 




t4- 

u- 



Q " O q O 'J 



WO 00/156; 



9 / 20 



09/78709 

PCT/US99/20948 



1 


ATGGTGGCCG 


CAGCGGCGGC 


AACTGAGGCA 


AGGCTGAGGA 


GGAGGACGGC 


51 


GGCGACGGCA 


GCGCTCGCGG 


GCAGGAGCGG 


CGGGCCGCAC 


TGTGTCAACG 


101 


GCGGTCGCTG 


CAACCCTGGC 


ACCGGCCAGT 


GCGTCTGCCC 


CGCCGGCTGG 


151 


GTGGGCGAGC 


AATGCCAGCA 


CTGCGGGGGC 


CGCTTCAGAC 


TAACTGGATC 


201 


TTCTGGGTTT 


GTGACAGATG 


GACCTGGAAA 


TTATAAATAC 


AAAACGAAGT 


251 


GCACGTGGCT 


CATTGAAGGA 


CAGCCAAATA 


GAATAATGAG 


ACTTCGTTTC 


301 


AATCATTTTG 


CTACAGAGTG 


TAGTTGGGAC 


CATTTATATG 


TTTATGATGG 


351 


GGACTCAATT 


TATGCACCGC 


TAGTTGCTGC 


ATTTAGTGGC 


CTCATTGTTC 


401 


CTGAGAGAGA 


TGGCAATGAG 


ACTGTCCCTG 


AGGTTGTTGC 


CACATCAGGT 


451 


TATGCCTTGC 


TGCATTTTTT 


TAGTGATGCT 


GCTTATAATT 


TGACTGGATT 


501 


TAATATTACT 


TACAGTTTTG 


ATATGTGTcC 


AAATAACTGC 


TCAGGcCGAG 


551 


GAGAGTGTAA 


GATCAGTAAT 


AGCAGCGAAA 


CTGTTGAATG 


TGAATGTTCT 


601 


GAAAACTGGA 


AAGGTGAAGC 


ATGTGACATT 


CCTCACTGTA 


CAGACAACTG 


651 


TGGTTTTCCT 


CATCGAGGCA 


TCTGCAATTC 


AAGTGATGTC 


AGAGGATGCT 


701 


CCTGCTTCTC 


AGACTGGCAG 


GGTCCTGGAT 


GTTCAGTTCC 


TGTACCAGCT 


751 


AACCAGTCAT 


TTTGGACTCG AGAGGAATAT 


TCTAACTTAA 


AGCTCCCCAG 


801 


AGCATCTCAT 


AAAGCTGTGG 


TCAATGGAAA 


CATTATGTGG 


GTTGTTGGAG 


851 


GATATATGTT 


CAACCACTCA 


GATTATAACA 


TGGTTCTAGC 


GTATGACCTT 


901 


GCTTCTAGGG AGTGGCTTCC 


ACTAAACCGT 


TCTGTGAACA 


ATGTGGTTGT 


951 


TAGATATGGT 


CATTCTTTGG 


CATTATACAA GGATAAAATT 


TACATGTATG 


1001 


GAGGAAAAAT 


TGATCCAACT 


GGGAATGTGA 


CCAATGAGTT GAGAGTTTTT 


1051 


CACATTCATA ATGAGTCATG 


GGTGTTGTTG 


ACCCCTAAGG 


CAAAGGAGCA 


1101 


GTATGCAGTG 


GTTGGGCACT 


CTGCACACAT 


TGTTACACTG AAGAATGGCC 


1151 


GAGTGGTCAT GCTGGTCATC 


TTTGGTCACT 


GCCCTCTCTA 


TGGATATATA 


1201 


AGCAATGTGC 


AGGAATATGA 


TTTGGATAAG 


AACACATGGA 


GTATATTACA 


1251 


CACCCAGGGT 


GCCCTTGTGC 


AAGGGGGTTA 


CGGCCATAGC 


! AGTGTTTACG 


1301 


ACCATAGGAC 


CAGGGCCCTA 


TACGTTCATG 


GTGGCTACAA 


, GG CTTTCAGT 
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1351 GCCAATAAGT ACCGGCTTGC AGATGATCTC TACCGATATG ATGTGGATAC 
1401 CCAGATGTGG ACCATTCTTA AGGACAGCCG ATTTTTCCGT T ACTTG C AC A 
1451 CAGCTGTGAT AGTGAGTGGA ACCATGCTGG TGTTTGGGGG AAACACACAC 
1501 AATGACACAT CTATGAGCCA TGGCGCCAAA TGCTTCTCTT CAGATTTCAT 
1551 GGCCTATGAC ATTGCCTGTG ACCGCTGGTC AGTGCTTCCC AGACCTGATc 
1601 TCCACCATGA TGTCAACAGA TTTGGCCATT CAGCAGTCTT ACACAACAGC 
1651 ACCATGTATG TGTTCGGTGG TTTCAATAGT CTCCTCCTCA GCGACATCCT 
1701 GGTATTCACC TCGGAACAGT GTGATGCGCA TCGGAGTGAA GCCGCTTGTT 
1751 TAGCAGCAGG ACCTGGTATT CGGTGTGTGT GGAACACAGG GTCGTCTCAG 
1801 TGTATCTCGT GGGCGCTGGC AACTGATGAA CAAGAAGAAA AGTTAAAATC 
1851 AGAATGTTTT TC CAAAAG AA CTCTTGACCA TGACAGATGT GACCAGCACA 
1901 CAGATTGTTA CAGCTGtACA GCCAACACCA ATGACTGCCA CTGGTGCAAT 
1951 GACCATTGTG TCCCCAGGAA CCACAGCTGC TCAGAAGGCC AGATCTCCAT 
2001 TTTTAGGTAT GAGAATTGCC CCAAGGATAA CCCcATGTAC TACTGTAACA 
2051 AGAAGACCAG CTGCAGGAGC TGTGCCCTGG ACCAGAACTG CCAGTGGGAG 
2101 CCCCGGAATC AGGAGTGCAT TGCCCTGCCC GAAAATATCT GTGGCATTGG 
2151 CTGGCATTTG GTTGGAAACT CATGTTTGAA AATTACTACT GCCAAGGAGA 
2201 ATTATGACAA TGCTAAATTG TTCTGTAGGA ACCACAATGC CCTTTTGGCT 
2251 TCTCTTACAA CCCAGAAGAA GGTAGAATTT GTCCTTAAGC AGCTGCGAAT 
2301 AATGCAGTCA TCTCAGAGCA TGTCCAAGCT CACCTTAACC CCATGGGTCG 
2351 GCCTTCGGAA GATCAATGTG TCCTACTGGT GCTGGGAAGA TATGTCCCCA 
2401 TTTACAAATA GTTTACTACA GTGGATGCCG TCTGAGCCCA GTGATGCTGG 
2451 ATTCTGTGGA ATTTTATCAG AACCCAGTAC TCGGGGACTG AAGGCTGCAA 
2501 CCTGCATCAA CCCACTCAAT GGTAGTGTCT GTGAAAGGCC TGCAAACCAC 
2551 AGTGCTAAGC AGTGCCGGAC ACCATGTGCC TTGAGGACAG CATGTGGAGA 
2601 TTGCACCAGC GGCAGCTCTG AGTGCATGTG GTGCAGCAAC ATGAAGCAGT 
2651 GTGTGGACTC CAATGCCTAT GTGGCCTCCT TCCCTTTTGG CCAGTGTATG 



09/787097 

11/20 



WO 00/15^ PCT/US99/20948 



2701 


GAATGGTATA 


CGATGAGCAC 


pTr h c C C C CT 


GAAAATTGTT CAGGCTACTG 


2751 


TACCTGTAGT 


CATTGCTTG\j 


ALj L-AAV- y — r\KJ\J 


CTGTGGCTGG TGTACTGATC 


2801 


CCAGCAATAC 


TGGCAAAGGG 


AAA ioUn J- M.O 


AGGGTTCCTA TAAAGGACCA 


2851 


GTGAAGATGC 


CTTCGCAAGC 


CL-L- 1 Ak—rivTVJrt 


AATTTCTATC CACAGCCCCT 


2901 


GCTCAATTCC 


AGCATGTGTC 




CAGATACAAC TGGTCTTTCA 


2951 


TTCACTGTCC 


AGCTTGCCAA 




ACAGTAAATG CATCAATCAG 


3001 


AGCATCTGTG 


AGAAGTGTGA 


a t\ r* r~"m a rr 
GAAL-\- Iw^-*- 


ACAGGCAAGC ACTGCGAGAC 


3051 


CTGCATATCT 


GGCTTCTACG 


GTGA I UUL-fi^ 


CAATGGAGGG AAATGTCAGC 


3101 


CATGCAAGTG 


CAATGGGCAC 


/~« r"r*> •T♦^ , ♦T , ^ , 'TY3 r T , 

GCGT(- i U ivj i 


GCAACACCAA CACGGGCAAG 


3151 


TGCTTCTGCA 


CCACCAAGGG 


CG T CAAvjvj<j<j 


GACGAGTGCC AGCTATGTGA 


3201 


GGTAGAAAAT 


CGATACCAAG 


GAAACLL1L1 


CAGAGGAACA TGTTATTATA 


3251 


CTCTTCTTAT TGACTATCAG 


TTCACCTT 1A 


aTCTATCCCA GGAAGATGAT 


3301 


CGCTATTACA 


CAGCTATCAA 


TTTTGTGGu I 


nrTrCTGACG AACAAAACAG 


3351 


GGATTTGGAC 


ATGTTCATCA 


ATGCL. I Uv— rt-B- 


GAATTTCAAC CTCAACATGA 


3401 


CCTGGGCTGC 


CAGTTTCTCA 


GCtGGAACCC 


AGGCTGGAGA AGAGATGCCT 


3451 


GTTGTTTCAA 


AAACCAACAT 


TAAGGAGTAC 


AAAGATAGTT TCTCTAATGA 


3501 


GAAGTTTGAT 


TTTCGCAACC 


AC CCAAATAT 


CACTTTCTTT GTTTATGTCA 


3551 


GTAATTTCAC 


CTGGCCCATC 


AAAATTCAGG 


TGCAAACTGA ACAATGA 
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FIGURE 9 



MVAAAAATEA RL^TA £AGR § GGPH ™C« G . = aW 
SUS «SS ^2£sG LIVKgGg TVPSWATSG 

S ESS SSS 

i» SS S^PRASH S VW ?S HW 0™DL 
& fflSS SSS K^WHLVX FGHCP^I 

JSJ SNVQEYDLDK TMLVFGGNTH 

451 ""SS^X SSSX IAcS^P RPDLHHDTOR FGHSAVLHNS 
501 NDTSMSHGAK c ffS°™AYD "^"g AACLAAGPGI RCVWNTGSSQ 
551 TMYVFGGFNS LLLSDILVFT |MCTW«bt ANTNDCHWCN 

iss» 1 S sss 

SSSSSS I fi I CTDPSKTGKG 2KSS 

& =S §1 111 ilil iiil 

ii = ill lil S3 ISi 

1201 NFMDLVQFFV TFFSCFLSLL LVAAVWKl^ piALEPCFGN KAAVLSVFVR 

[ill SSSffl ESSSS ISSiSSS ?™«v 

1351 PGTCI 



101 
151 
201 



601 
651 
701 
751 
301 
851 
901 
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FIGURE I OA 



i acqocaacca cagcggcggc aaccgaggca aggc-gagga ggaggacggc 

51 qgcgacggca gcgcccgcgg gcaggagcgg cgggccgcac cgcgccaacg 

1Q1 gcggcclccg caaccccggc accggccagc gcg-ctgccc cgccggccgg 

151 qcgggcgaac aacgccagca ccgcgggggc cgccccagac caaccggacc 

2of EccEaggttt gcaacagacg gacccggaaa tca-aaacac aaaacgaagc 

25l gcacaEggcc caEcgaagga cagccaaata gaacaatgag acttcgtccc 

301 LtcatEEcg ctacagagcg C ag«gggac cacctatatg "catgacgg 

351 ggactcaatt tacgcaccgc cagccgczgc acc-agtggc cccactgtcc 

401 Scgagagaaa ccgcaacgag accgtccccg aggcrgtcgc cacatcagg. 

I 01 tacgccccgc t^cacctctt cagcgacgct gccrataatt tgaccggacc 

501 taatattacc cacagctttg acatgtgccc aaanaactgc tcaggccgag 

551 gagagtgtaa gatcagtaat agcagcgaca ctgztgaatg tgaatgtcct 

601 gaaalctgga aaggcgaagc acgtgacact ccccactgta cagacaaccg 

651 ?ggc«tI2t caccgaggca tctgcaaccc aagcgatgcc agaggatgcc 

701 cllgcctctc agactggcag ggtcctggat gtccagttcc tgtaccagcc 

751 aac?agtcat tttggactcg agaggaatat tctaacttaa agctccccag 

801 agcaccccat aaagccgtgg tcaatggaaa canratgtgg gntgttggag 

851 gScatatgce caaccactca gattataaca cgczzctagc gcatgacct: 

5 acctc-aggg aacggcttcc actaaaccgc tc^gcgaaca acgcggccgc 

HI fSaca?H? catlltttgg cattatacaa ggacaaaatt tacatgtatg 

loJl giggaaallt tgacccaall gggaatgcga ccaatgagtt gagagtttrt 

loll XS«cata aEgagtcatg ggtgttgttg acccctaagg caaaggagca 

litl ataScagtg gttgggcact ctgcacacat tgccacactg aagaatggcc 

llll gagtS?a? |ctgi?catc tttggtcact gccctctcta cggatatata 

1201 !S?a!?gtgc Iggaatatga tttggataag aacacatgga gtatattaca 

1251 clcccagggt gwcttgtgc- aagggggcta cggccatagc agtgtttacg 

1301 accatagglc 8agggcccta tacgttcatg gtggctacaa ggctttcagt 

1351 qccaatfagt acllgcttgc agatgatctc taccgatatg atgtggatac 

llll ?caqatgtgg accaEtctta aggacagccg atctttccgt tacttgcaca 

llll cagc?g?g!? agtgagtgga accatgccgg tgtctggggg aaacacacac 

Itll HtgalaLt cfalgagcca tggcgccaaa tgcctctctt cagatttcat 

1551 qqcltacgac attgcctgtg accgctggcc agtgcttccc agacctgatc 

llll Sacca?ga tgt?aacaga cctggccact cagcagcctt acacaacagc 

1651 acclcgtalg tgtccggtgg cctcaacagt ctccccctca gcgacatcct 

llll ggtallcacl tlggaacagt gtgatgcgca tcggagtgaa gccgctcgtt 

1751 Eigcagcagg acctggtatt cggtgtgtgt ggaacacagg gtcgtctcag 

llll tgtSctcit gggcgctggc aactgatgaa caagaagaaa agttaaaatc 

"si afaltgtttt El?aaaagaa ctcttgacca tgacagatgt gaccagcaca 

llll clgat?gtta cagctgcaca gccaacacca atgactgcca ctggtgcaat 

1951 qaccactgtg tccccaggaa ccacagctgc tcagaaggcc agatctccat 

2?oi tSSIggtat gagaatEgcc ccaaggacaa ccccatgtac taccgtaaca 

loll agaagScag ctgcaggagc tgtgccctgg accagaactg ccagtgggag 

llll SSSgaatl aggagcgcat cgccctgccc gaaaatatcc gtggcattgg 

llll ctq^atttg gEEggaaact catgttcgaa aatcactact gccaaggaga 

22" at?!tgaca! IgcSaattg ttctgtagga accacaacgc ccttttggct 

llll ?ctSt?acaa cEcagaagaa ggtagaatct gtccctaagc agctgcgaat 

23oJ aatgcagtca cctcagagca tgtccaagct cacc-taacc ccacgggtcg 

llll qcc?tcggaa gatcaltgtg ccctactggc gccgggaaga tatgtcccca 

llll ?ttacaSta gtttactica gtggacgccg tctgagccca Qtgatgccgg 

2451 attccgcgga atcttaccag aacccagtac ccggggactg aaggctgcaa 

Isll- SSgcSSa cccactcaa? ggtagcgtct gtgaaaggcc tgcaaaccac 

255^ agtgccaagc agcgccggac accacgtgcc ttgaggacag cacgtggaga 

26" t?g?accagc glclgclctg agcgcacgcg gtgcagcaac acgaagcagt 
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FIGURE 10B 
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2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
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3151 
3201 
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3401 
3451 
3501 
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3601 
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3701 
3751 
3801 
3851 
3901 
3951 
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4051 



gtgtggacnc 
gaatggcaca 
Lacccgcagc 
ccagcaacac 
gcgaagacgc 
gctcaacccc 
cccaccgccc 
agcacctgcg 
ctgcatacct 
catgcaagtg 
tgcttctgca 
ggtagaaaat 
ctcttcttat 
cgctattaca 
ggatttggac 
cctgggccgc 
gttgttccaa 
gaagtttgac 
gtaatctcac 
aattttatgg 
ccctttgctc 
gggcctccag 
agccgtccct 
tcctgatctt 
tggagccgtg 
ctccctcgag 
tgtggccagc 
aggagaagtc 
cctgggacct 



caacgcccac 
cgacgagcac 
caccgcctgg 
cggcaaaggg 
ctccgcaagc 
agcangtgcc 
agctcgccaa 
agaagcgtga 
ggcttctacg 
caatgggcac 
ccaccaaggg 
cgataccaag 
tgactatcag 
cagccatcaa 
atgttcatca 
cagtttctca 
aaaccaacac 
tttcgcaacc 
ctggcccatc 
acctggtaca 
ctggtggctg 
acgtagagag 
ttgcctctgt 
attgggggga 
ttttggcaac 
gcctgggtgg 
gccctggtgg 
aggagccgtg 
gcatctga 



gcggcccccc 
ctqccccccz 
agcaaccagg 
aaatgcanag 
ccctacagga 
cagaggacag 
cgcaacggcc 
gaacctgacc 
gcgaccccac 
gcgtctctgt 
cgtcaagggg 
gaaacccccc 
tccaccctta 
ctttgcggct 
atgcctccaa 
gctggaaccc 
taaggagcac 
acccaaacat 
aaaactcaga 
gttcttcgtg 
ctgtggtttg 
caacttcttc 
aaatgtcgcc 
gtacaaagac 
aaagccgctg 
catccctcct 
acatttctca 
agaaaccgga 



tccc-ctcgg 
gaaaattgcc 
ccgtggctgg 
agggccccna 
aaccnctacc 
cagacacaac 
acagcaaang 
acaggcaagc 
caacggaggg 
gcaacaccaa 
gacgagcgcc 
cagaggaaca 
gcctatccca 
accccngacg 
gaacctcaac 
aggcnggaga 
aaagatagcc 
cacuttcttt 
ccgccttccc 
actttcttca 
gaagatcaaa 
gagagatgca 
ttggaaacag 
tgttcccaaa 
tcctctctgt 
cctgggcagt 
gcagatgccg 
agcagcagcc 



ccagcgtang 
caggccaccg 
cgnactgacc 
caaaggacca 
cacagcccct 
cggtctccca 
caccaatcag 
accgcgagac' 
aaatgtcagc 
cacgggcaag 
agccatgcga 
tgctattata 
ggaagatgac 
aacaaaacag 
ctcaacacca 
agagatgccc 
tctctaatga 
grttacgcca 
tzcagcacagc 
gttgtttcct 
caaagttgcr 
acagatggcc 
atgaggagcc 
cccactgcac 
gtttgtgagg 
caggtcttgc 
atagtgtaca 
ccctgcacag 
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FIGURE 1 1 

WAAAAAT£ARLR?.?.TAATA.^I^G?.SGGPmTOlvT)VTRAGRPGLGAGL?.LF?.I.13PPLR 
PRLLLLLLLLPPPLLLLLLPCEA^AAAAAAAVSGSAAAEAKECDP.PCWGGP.CNPGTG 
QCVCPAGVn?GEQCQHCGGRFRLTGS3GFVTDGPGMYKYKTKCr.-JLIEGQPl-!?.I?4RLRF 
OTFATECSWDKLr^/DGDSIYAPLVAAFSGLIVFERDGNETVPE'/VATSGYALLHFFS 
DAATOLTGFHITVSFDMCPHHCSGHGECKISNSSETVECECSEHWKGEACDISHCTDN 
CGFPHRGICHSSDVRGCSCFSDWQGPGCSVPVPANQSFWTREEYSHLKLPSASHKAW 
NGMIMWWGGYMFNHSDYWMVLATOLASREWLPL1TOSVNNVVVP.VGHS LALYKDKIYM 
YGGKIDPTGNVTNELRVFHIHNESWVLLTPKAKEQYAWGHSAHIVTLKNGRVVMLVI 
FGHCPLYGYISNVQEYDLDKNTOSILHTQGALVQGGYGHSSVYDHRTRALT.'HGGYKA 
FSANKYP.LADDLYRYDVDTQMWTILKDSRFFRYLHTAVIVSGTMLVFGGNTKTDTSMS 
D HGAKCFSSDFMAYBIACDRWSVLPRPDLHHDVNRFGHSAVLHilSTMYVFGGFITSLLLS 
•fi DILVFTSEQCDAHRSEAACtiAAGPGIRCVWNTGSSQCISWALATDEQEEKI.y.3ECFSK 
03 RTLDHDRCDQHTDCYSCTANTNDCH'.'ICNDHCVPRNHSCSEGOISIFRYENCPI-aJNPMY 
YCNKKTSCRSCALDQNCQWEPRNQECIALPENICGIGWHLVGNSCLKITTAKENYDNA 
5 KLFCRNHNAL1ASLTTQKKVEFVLKQLRIMQSSQSMSKLTLTPWVGLRKINVSYWCWE 
y DMSPFTNSLLQWMPSEPSDAGFCGILSEPSTRGLKAATCINPLNGSVCERPANHSAKQ 
p CRTPCALRTACGDCTSGSSECMWCSNMKQCVDSNAYVASFPFGQCMEWYTMSTCPPEN 
SI CSGYCX CSHCLEQPGCGWCTDPSNTGKGKCIEGSYKGPVKMPSQAPTGNFYPQPLLNS 
sj SMCLEDSRYNWSFIHCPACQCNGHSKCINQSICEKCENLTTGKHCETCISGFYGDPTN 
P GGKCQPCKCNGHASLCNTNTGKCFCTTKGVKGDECQLCEVENRYQGNPLRGTCYYTLL 
IDYQFTFS LSQEDDRYYTAINF\'ATPDEQNRDLDMFINASKNFNLNITWAASF3AGTC 
AGEEMPWSKTNIKEYKDSFSNEKFDFRNHPNITFFVYVSNFTWPIKIQVQTEQ 



WO 00/1561 



16 / 20 



09/787097 

PCT/US99/20948 



FIGURE 12 



I atggcggccg cagcggcggc aactgaggca 
61 gcgctcgcgg gcaggagccg cgggccgcac 
121 ccggggctgg gggccgggct gcgcctcccg 
181 ccgctgccgc tgctgttgct gctcccgccg 
241 gccgaggccg cggcggcggc ggcggcggcg 
301 cgtgaccggc cccgcgccaa cggcggccgc 
361 cccgccggcr gggcgggcga gcaatgccag 
421 tcttctgggc ctgcgacaga cggacccgga 
431 ctcaccgaag gacagccaaa tagaataatg 
541 cgcagctggg accacctaca tgtttatgat 
601 gcacttagcg gcctcaccgt tcctgagaga 
661 gccacatcag gttatgcctt gccgcactct 
721 tttaacatta cttacagcrt tgatatgtgt 
781 aagatcagta acagcagcga aactgttgaa 
841 gcacgtgaca ttcctcactg tacagacaac 
901 tcaagtgatg tcagaggatg ctcctgcttc 
961 cctgtaccag ctaaccagtc attttggact 
1021 agagcatctc ataaagctgt ggtcaatgga 
1081 ttcaaccacc cagattataa catggttcta 
1141 ccactaaacc gtcctgtgaa caatgtggcc 
1201 aaggataaaa cttacatgta cggaggaaaa 
1261 tcgagagttt ttcacactca caacgagcca 
1321 cagcatgcag cggtcgggca ctctgcacac 
1381 acgctggtca tctttggtca ctgccctcrc 
1441 gacttggaca agaacacatg gagtatacta 
1501 tacggccaca gcagtgttta cgaccatagg 
1561 aaggctttca gtgccaataa gtaccggcct 
1621 acccagatgt ggaccatrct taaggacagc 
1681 atagtgagtg gaaccatgct ggtgtttggg 
1741 catggcgcca aatgcttctc ttcagatttc 
1801 tcagtgcttc ccagacctga tctccaccat 
1861 ttacacaaca gcaccatgta tgtgttcggn 
1921 ctggtattca cctcggaaca gtgtgatgcg 
1981 ggacctggta ttcggtgtgt gtggaacaca 
2041 gcaactgatg aacaagaaga aaagttaaaa 
2101 catgacagat gtgaccagca cacagaccgt 
2161 cactggtgca atgaccattg tgtccccagg 
2221 atttttaggt atgagaattg ccccaaggac 
2281 agcngcagga gctgtgccct ggaccagaac 
2341 actgcccngc ccgaaaacac ctgtggcatc 
2401 aaaattacca ctgccaagga gaattatgac 
2461 gcccttttgg cttctcttac aacccagaag 
2521 ataatgcagt catctcagag cacgtccaag 
2581 aagatcaatg tgtcctactg gtgctgggaa 
2641 cagtggatgc cgtctgagcc cagcgacgcc 
2701 actcggggac tgaaggctgc aacctgcacc 
2761 cctgcaaacc acagtgctaa gcagtgccgg 
2821 gattgcacca gcggcagctc tgagtgcatg 
2881 tccaatgcct acgtggcctc cttccctcrt 
2941 acctgccccc ctgaaaattg ctcaggccac 
3001 ggctgtggct ggtgtactga tcccagcaat 
3061 tacaaaggac cagtgaagat gccntcgcaa 
3121 ctgctcaatt ccagcacgtg tccagaggac 
3181 ccagcctgcc aatgcaacgg ccacagcaaa 
3241 gagaacctga ccacaggcaa gcactgcgag 
3301 accaacggag ggaaatgtca gccatgcaag 
3 361 aacacgggca agtgcttctg caccaccaag 
3421 gaggtagaaa atcgatacca aggaaacccc 
3481 attgactatc agttcacctt tagtctatcc 
3 541 aactttgtgg ctactcccga cgaacaaaac 
3601 aagaatttca acctcaacat cacccgggcc 
3661 gaagagatgc ctgttgtctc aaaaaccaac 
3721 gagaagcttg attttcgcaa ccacccaaat 
3781 acccggccca ccaaaactca ggcgcaaacn 



aggccgagga ggaggacggc ggcgacggca 
tgggac-ggg acgcgaccag ggcigggagg 
cggccgccgt czccaccqcz gcggccacgg 
ccgccgncgc tgcrgccgcr gccccgcgag 
tcgggcccag ccgcagccga ggccaaggaa 
cgcaaccctg gcaccggcca gtgcgcccgc 
cactgcgggg gccgcttcag accaacngga 
aaccacaaac acaaaacgaa gcgcacgcgg 
agact"gtr tcaatcactt cgccacagag 
ggggacccaa cctacgcacc gcragttgct 
gatggcaatg agactgtccc tgaggttgtt 
ctcagcgacg ccgcntataa tctgaccgga 
ccaaauaacc gcccaggccg aggagagcgt 
cgcgaangct ccgaaaactg gaaaggcgaa 
tgtggttttc ctcatcgagg cacccgcaat 
ccagactggc agggtcctgg atgcccagtt 
cgagaggaat attccaactt aaagctcccc 
aacatcatgt gggttgttgg aggacatatg 
gcgcacgacc ctgcttctag ggagtggctt 
gccagacacg gtcattcttt ggcaccatac 
accgacccaa ccgggaatgt gaccaatgag 
tgggcgccgt tgacccccaa ggcaaaggag 
accgucacac cgaagaatgg ccgagtggtc 
cacggacata taagcaatgt gcaggaacat 
cacacccagg gcgcccttgt gcaagggggt 
accagggccc tacacgctca cggtggctac 
ccagacgatc tctaccgata tgatgtggat 
cgattrttcc gttacttgca cacagccgtg 
ggaaacacac acaatgacac atctatgagc 
atggcctatg acattgcctg tgaccgctgg 
gatgtcaaca gatttggcca ttcagcagtc 
ggtttcaata gtctcctcct cagcgacatc 
caccggagcg aagccgcttg tttagcagca 
gggccgtctc agtgtatctc gtgggcgctg 
tcagaatgtt tttccaaaag aactcttgac 
tacagcrgta cagccaacac caatgactgc 
aaccacagct gctcagaagg ccagatctcc 
aaccccatgt actactgtaa caagaagacc 
cgccagnggg agccccggaa tcaggagtgc 
ggctggcatt tggctggaaa cccatgtctg 
aangctaaat tgttctgcag gaaccacaac 
aaggcagaat ttgtccttaa gcagctgcga 
cccaccttaa ccccatgggt cggccttcgg 
gatacgcccc catttacaaa tagtttacta 
ggaccccgtg gaattttatc agaacccagt 
aacccactca atggtagtgt ctgtgaaagg 
acaccatgtg ccttgaggac agcatgtgga 
tzggtgcagca acatgaagca gtgtgtggac 
ggccagtgta tggaatggta tacgatgagc 
tgtacctgta gtcattgctt ggagcaacca 
accggcaaag ggaaatgcat agagggttcc 
gcccccacag gaaatttcta tccacagccc 
agcagataca actggtcttt cattcaccgt 
tgcaccaacc agagcacccg tgagaagtgt 
acctgcacat ccggcctcta cggtgatccc 
tgcaacgggc acgcgtctct gtgcaacacc 
ggcgtcaagg gggacgagtg ccagctatgt 
cccagaggaa cacgctatta tactcttctt 
caggaagatg atcgctatta cacagctatc 
agggacctgg acacgttcat caacgcctcc 
gccagrttct cagccggaac ccaggctgga 
accaaggagc acaaagatag tttctctaat 
accaccttct ctgcctatgn cagtaacttc 
gaacaacga 
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FIGURE 13 

1 MVAAAAATZA RLRRRTAATA ALAGRSGGPH WDWDVTRAGR PGLGAGLRLP 

51 RLLS?PLR D R LLLLLLLLPP PLLLLLLPCE AEAAAAAAAV SGSAAAEAKE 

101 CDRPCVNGGR CNPGTGQCVC PAGWVGEQCQ HCGGRFRLTG SSGFVTDGPG 

151 NYKYKTKC^VJ LIEGQPNRIM RLRFNHFATE CSWDHLYVYD GDSIYAPLVA 

201 AFSGLIVOER DGNETVPEW ATSGYALLHF FSDAAYNLTG FNITYSFDMC 

251 PNNCSGRGEC KISNSSETVE CECSENWKGE ACDIPHCTDN CGFPHRGICN 

301 SSDVRGCS"- SDWQGPGCSV PVPANQSFWT REEYSMLKLP RASHKAWNG 

351 NIMWWGGYM FNHSDYNMVL AYDLASREWL FLNRSVNNW VRYGHSLALY 

401 KDKIYMYGGK IDPTGNVTNE LRVFHIHNES WVLLTPKAKE QYAWGHSAH 

451 IVTLKNGRW MLVIFGHCPL YGYISNVQEY DLDKNTWSIL KTQGALVQGG 

501 YGHSSVYDHR TRALYVHGGY KAFSANKYRL ADDLYRYDVD TQMWTILKDS 

551 RFFRYLHTAV IVSGTMLVFG GNTHNDTSMS HGAKCFSSDF MAYDIACDRW 

601 SVLPRPDLHH DVNRFGHSAV LHNSTMYVFG GFNSLLL5DI LVFTSEQCDA 

651 HRSEAACLAA GPGIRCVWNT GSSQCISWAL ATDEQEEKLK SECFSKRTLD 

701 HDRCDQHTDC YSCTANTNDC HWCNDHCVPR NHSCSEGQIS IFRYENCPKD 

751 NPMYYCNKKT SCRSCALDQN CQWEPRNQEC IALPENICGI GWHLVGNSCL 

801 KITTAKENYD NAKLFCRNHN ALLASLTTQK KVEFVLKQLR IMQSSQSMSK 

8 51 LTLTPWVGLR KINVSYWCWE DMSPFTNSLL QWMPSEPSDA GFCGILSEPS 

901 TRGLKAAT^I NPLNGSVCER PANHSAKQCR TPCALRTACG DCTSGSSECM 

951 WCSNMKQC'/D SNAYVASFPF GQCMEWYTMS TCPPENCSGY CTCSHCLEQP 

1001 GCGWCTDPSN TGKGKCIEGS YKGPVKMPSQ APTGNFYPQP LLNSSMCLED 

1051 SRYNWSFIHC PACQCNGHSK CINQSICEKC ENLTTGKHCE TCISGFYGDP 

1101 TNGGKCQPCK CNGHASLCNT NTGKCFCTTK GVKGDECQLC EVENRYQGNP 

1151 LRGTCYYTLL IDYQFTFSLS QEDDRYYTAI NFVATPDEQN RDLDMFINAS 

1201 KNFNLNITWA ASFSAGTQAG EEMPWSKTN IKEYKDSFSN EKFDFRNHPN 

1251 ITFFVYVSNF TWPIKIQIAF SQHSNFMDLV QFFVTFFSCF LSLLLVAAW 

1301 WKIKQSCWAS RRREQLLREM QQMASRPFAS VNVALETDEE PPDLIGGSIK 

1351 TVPKPIALEP CFGNKAAVLS VFVRLPRGLG GIPPPGQSGL AVASALVDIS 

1401 QQMPIVYKEK SGAVRNRKQQ PPAQPGTCI 
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FIGURE 14A 



i atqacggccQ cagcggcggc aaccgaggca aggccgagga ggaggacggc 

51 qqcgacggca gcgcccgcgg gcaggagcgg cgggccgcac cgggaccggg 

101 Icqtgaccag ggctgggagg ccggggctgg gggccgggcc gcgcctcccg 

151 cggccgccgc ccccaccgcc gcggccacgg ccgccgczgc cgccgccgtc 

201 qctcccgccg ccgccgctgc tgccgctgct gccccg-gag gccgaggccg 

251 cqqcggcggc cgcggcggcg ccgggcccag ccgcagccga ggccaaggaa 

301 tgtgaccqgc Sccgcgccaa cggcggtcgc -gcaacc-g gcaccggcca 

351 qtgcgcctgc cccgccggct gggtgggcga gcaacgccag cactgcgggg 

401 gccg«tcag actaactgga ccttctgggc ctgcgacaga tggacctgga 

ASl aattacaaac acaaaacgaa gcgcacgcgg cccaccgaag gacagccaaa 

501 taqaataatg agacctcgct ccaatcacct cgccacagag tgtagtcggg 

551 accatctaca tgtctatgac ggggactcaa tctatgcacc gccagttgcc 

601 qcacttagtg gcctcattgt tcctgagaga gacggcaacg agactgcccc 

651 tqaggttgtc gccacatcag gtcatgcctt gctgcacttt cccagcgacg 

701 clgcttataa tttgactgga cttaatatta cttacagttt tgacatgtgt 

751 ccaaataact gctcaggccg aggagagtgt aagaccagca acagcagcga 
aactqttqaa tgcgaatgtt ctgaaaactg gaaagg-gaa gcacgcgaca 



801 



851 ccccccaccg cacagacaac tgtggtcctc ctcatcgagg cacctgcaat 

901 tcaagcgacg ccagaggacg ctcccgcttc ccagacrggc agggccccgg 

951 acgtEcagtt cccgtaccag ctaaccagtc actttggact cgagaggaac 

1001 attctaactt aaagcccccc agagcatctc ataaagc-gc ggccaatgga 

1051 aacattatgt gggttgttgg aggatatatg ttcaaccact cagattataa 

"oi wtggttcta gSgtatgacc ttgcttctag ggagtggctt ccactaaacc 

"si g?tl?gtgaa Saatgtggtt gttagatacg gtcatccrtt ggcactatac 

1201 aaggacaaaa tttacacgta tggaggaaaa attgatccaa ccgggaatgt 

1251 gaScaatgag ttgagagctt ttcacattca taatgagtca tgggtgttgt 

1301 fgacccclaa ggcaaaggag cagtatgcag tggttgggca ctctgcacac 

llll a??gttacac llaagaatgg ccgagtggtc atgctggrca tctttggtca 

1401 ctgccctctc tatggatata taagcaacgt gcaggaacat gacttggata 

1451 aqaacacatg gagtatatta cacacccagg gtgcccrcgt gcaagggggt 

1501 tacggccata gcagtgttta cgaccatagg accagggccc tacacgttca 

1551 tggcggctac aaggctttca gtgccaataa gtaccggcct gcagatgatc 

1601 tctaccgata tgacgtggat acccagatgt ggaccaccct taaggacagc 

1651 cqatttttcc gttactcgca cacagctgtg atagtgagcg gaaccatgct 

1701 qgtgtttggg ggaaacacac acaatgacac atctacgagc catggcgcca 

1751 Ilticttllc Etcagatttc atggcctatg acattgcccg tgaccgctgg 

1801 tcagtgcctc ccagacctga tctccaccat gatgtcaaca gatttggcca 

1851 ttcagcagtc ttacacaaca gcaccatgta tgtgtccggt ggtttcaata 

1901 gtctcctcct cagcgacatc ctggtattca cctcggaaca gtgtgatgcg 

1951 catcggagtg aagccgcttg tttagcagca ggacccggca ttcggtgtgt 

2001 gtggaacaca gggtcgtctc agtgtatctc gtgggcgccg gcaactgatg 

2051 aaliagaaga iaagttaaaa tcagaatgtt tttccaaaag aactcttgac 

2101 catgalaglt gtgaccagca cacagattgt tacagcsgta cagccaacac 

2151 caatgactgc cactggcgca atgaccattg cgtccccagg aaccacagct 

2201 gctclgaagg ccagatctcc atttttaggt atgagaatcg ccccaaggat 

2251 Lccclatgl actactgtaa caagaagacc agctgcagga gccgtgccct 

2301 ggaccagaac tgccagtggg agccccggaa tcaggagzgc ^"gccctgc 

2351 Ilgaaaatat ctgtggcatt ggctggcatt tggttggaaa "catgtttg 

2401 aalattacta ctgccaagga gaattatgac aatgccaaac tgttctgtag 

2451 gaaccacaat gcccttctgg cttctcttac aacccagaag aaggtagaat 

2501 ttgtccttaa gcagctgcga ataatgcagt catctcagag catgtccaag 

2551 cclaccttaa ccccatgggc cggcctccgg aagatcaacg tgccctactg 

2601 gcgccgggaa gatacgcccc catttacaaa cagttcacca cagtggatgc 
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Figure 14B 

2651 cgtctgaacc cagcgacacc ggactctgtg gaatcttatc agaacccagt 

2701 accccaggac -gaaggcngc aacccgcacc aacccaccca acggcagcgt 

2751 ctgcgaaaag cccgcaaacc acagtgccaa gcagcgccgg acaccacgtg 

2801 ccttqaagac aacacgcgga gatcgcacca gcggcagccc cgagtgcacg 

2851 cggtQcagca acacgaagca gtgtgcggac tccaacgccc acgtggcctc 

2901 cctcc-rccc qqccagcgta cggaatggca cacgacgagc acccgccccc 

2951 ccgaaaattg cccaggccac tgtacccgca gccaccgccc ggagcaacca 

3001 qgccgcggcc ggtgtactga ccccagcaat actggcaaag ggaaatgcat 

3051 aqagggtccc tataaaggac cagcgaagac gcctccgcaa gcccccacag 

3101 gaaacutcca cccacagccc ctgcccaaec ccagcacgcg cctagaggac 

315T agcaaataca accggncttc cattcaccgc ccagcccgcc aatgcaacgg 

3201 ccacaataaa tgcatcaatc agagcatctg cgagaagcgt gagaacccga 

3251 ccacaggcaa gcactgcgag acctgcatac ccggccccta cggtgacccc 

3301 accaacagag ggaaatgtca gccacgcaag tgcaacgggc acgcgcccct 

3351 qcgcaa^acc aacacgggca agtgcttccg caccaccaag ggcgccaagg 

3401 gggacgagtg ccagctatgc gaggtagaaa accgatacca aggaaaccct 

3451 ?ccagaggaa catgtcatta tactcttctc atcgaccatc agttcacctt 

3501 cagtccancc caggaaaacg accgccacca cacagctat-c aatttcgcgg 

3551 ccactcctaa cqaacaaaac agggatccgg acatgtccat caacgcctcc 

3601 aagaacttia acctcaacat cacctgggcc gccagcctct cagctggaac 

3651 ccaggctgga gaagagatgc ctgttgectc aaaaaccaac attaaggagt 

3701 acaaaqatag tctctccaat gagaagtttg attttcgcaa ccacccaaat 

3751 accactttct ctgtttatgt cagcaatttc acctggccca tcaaaattca 

3801 gattgccctc tctcagcaca gcaattttat ggacccggta cagttcttcg 

3851 tgactttctt cagttgtttc ccctctttgc ccctggcggc cgctgcggct 

3 901 tggaagacca aacaaagttg ttgggcctcc agacgtagag agcaacttct 

3 951 tcgagagatg caacagatgg ccagccgtcc ctttgcctct gtaaacgtcg 

4001 ccttggaaac agatgaggag cctcctgatc ttatcggggg gagtataaag 

4051 actgttccca aacccattgc actggagccg tgttttggca acaaagccgc 

4101 tgtcctctct gcgcttgtga ggctccctcg aggcccgggt ggcatccctc 

4151 cccccgggca gtcaggtccc gccgtggcca gcgccctggt ggacatttct 

4201 cagcagatgc cgatagtgta caaggagaag tcaggagccg cgagaaaccg 

4251 gaagcagcag ccccccgcac agcctgggac ctgcacctga 
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